Amazon products reviews classification based on machine learning, deep learning methods and BERT

نویسندگان

چکیده

In recent times, the trend of online shopping through e-commerce stores and websites has grown to a huge extent. Whenever product is purchased on an platform, people leave their reviews about product. These are very helpful for store owners product’s manufacturers betterment work process as well quality. An automated system proposed in this that operates two datasets D1 D2 obtained from Amazon. After certain preprocessing steps, N-gram word embedding-based features extracted using term frequency-inverse document frequency (TF-IDF), bag words (BoW) global vectors (GloVe), Word2vec, respectively. Four machine learning (ML) models support vector machines (SVM), logistic regression (RF), (LR), multinomial Naïve Bayes (MNB), deep (DL) convolutional neural network (CNN), long-short memory (LSTM), standalone bidirectional encoder representations (BERT) used classify either positive or negative. The results by standard ML, DL BERT evaluated performance evaluation measures. turns out be best-performing model case with accuracy 90% derived embedding while CNN provides best 97% upon D2. shows better overall compared D1.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Sentiment Classification of Spanish Reviews: An Approach based on Feature Selection and Machine Learning Methods

Sentiment analysis aims to extract users’ opinions from review documents. Nowadays, there are two main approaches for sentiment analysis: the semantic orientation and the machine learning. Sentiment analysis approaches based on Machine Learning (ML) methods work over a set of features extracted from the users’ opinions. However, the high dimensionality of the feature vector reduces the effectiv...

متن کامل

on the comparison of keyword and semantic-context methods of learning new vocabulary meaning

the rationale behind the present study is that particular learning strategies produce more effective results when applied together. the present study tried to investigate the efficiency of the semantic-context strategy alone with a technique called, keyword method. to clarify the point, the current study seeked to find answer to the following question: are the keyword and semantic-context metho...

15 صفحه اول

Amazon Food Review Classification using Deep Learning and Recommender System

In this paper we implemented different models to solve the review usefulness classification problem. Both feed-forward neural network and LSTM were able to beat the baseline model. Performances of the models are evaluated using 0-1 loss and F-1 scores. In general, LSTM outperformed feed-forward neural network, as we trained our own word vectors in that model, and LSTM itself was able to store m...

متن کامل

Body Mass Index Classification based on Facial Features using Machine Learning Algorithms for utilizing in Telemedicine

Background and Objectives: Due to the impact of controlling BMI on life, BMI classification based on facial features can be used for developing Telemedicine systems and eliminating the limitations of measuring tools, especially for paralyzed people. So that physicians can help people online during the Covid-19 pandemic. Method: In this study, new features and some previous work features were e...

متن کامل

EEG Classification based on Machine Learning Techniques

The main issue to build applicable Brain-Computer Interfaces is the capability to classify the electroencephalograms (EEG). During the last decade, researchers developed lots of interests in this field. The purpose behind this research is to improve a model for EEG signals analysis. Filtration of EEG Signals is essential to remove artifacts. Otherwise, wavelet transform was used to extract feat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: TELKOMNIKA Telecommunication Computing Electronics and Control

سال: 2023

ISSN: ['1693-6930', '2302-9293']

DOI: https://doi.org/10.12928/telkomnika.v21i5.24046